Learning to Play General-Sum Games against Multiple Boundedly Rational Agents

نویسندگان

چکیده

We study the problem of training a principal in multi-agent general-sum game using reinforcement learning (RL). Learning robust policy requires anticipating worst possible strategic responses other agents, which is generally NP-hard. However, we show that no-regret dynamics can identify these worst-case poly-time smooth games. propose framework uses this evaluation method for efficiently RL. This be extended to provide robustness boundedly rational agents too. Our motivating application automated mechanism design: empirically demonstrate our learns mechanisms both matrix games and complex spatiotemporal In particular, learn dynamic tax improves welfare simulated trade-and-barter economy by 15%, even when facing previously unseen RL taxpayers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cooperative oligopoly games with boundedly rational firms

We analyze cooperative Cournot games with externalities. Due to cognitive constraints, the members of a coalition cannot accurately predict the coalitional actions of the non-members. Thus, they compute their value following simple heuristics. In particular, they assign various non-equilibrium probability distributions over the outsiders’ set of partitions. We construct the value function of a ...

متن کامل

Agent-based Modeling with Boundedly Rational Agents

This chapter introduces an agent-based modeling framework for reproducing micro behavior in economic experiments. It gives an overview of the theoretical concept which forms the foundation of the framework as well as short descriptions of two exemplary models based on experimental data. The heterogeneous agents are endowed with a number of attributes like cooperativeness and employ more or less...

متن کامل

Repeated Moral Hazard with Boundedly Rational Agents

In this paper we consider a situation where a number of identical myopic agents enter a long term contract with a principal. The actions of the agents cannot be observed, and the principal ooers the agents a payment scheme where payments from the principal to the agents are based on the observable outcome in the corresponding period. Whereas the principal knows the distribution of outcomes, giv...

متن کامل

Competition over agents with boundedly rational expectations

I study a market model in which profit-maximizing firms compete in multidimensional pricing strategies over a consumer, who is limited in his ability to grasp such complicated objects and therefore uses a sampling procedure to evaluate them. Firms respond to increased competition with an increased effort to obfuscate, rather than with more competitive pricing. As a result, consumer welfare is n...

متن کامل

A Model of Boundedly Rational “Neuro” Agents

We consider a model in which each agent in a population chooses one of two options. Each agent does not know what the available options are and can choose an option only after observing another agent who has already chosen that option. In addition, the agents’ preferences over the two options are correlated. An agent can either imitate an observed agent or wait until he meets two agents who mad...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i10.26391